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TEAM  THEORY  AND  DECENTRALIZED  RESOURCE  AL LOCATION:  AN  EXAMPLE* 


by 

Kenneth  J .  Arrow 


1.  Introduction. 

The  traditional  discussion  of  the  price  system  and  alternative 
forms  of  decentralized  resource  allocation  in  organizations  and  entire 
economies  has  an  ambivalent  attitude  to  the  ease  of  transferring  infor¬ 
mation  from  one  locus  in  the  economic  system  to  another.  On  the  one 
hand,  the  very  need  for  decentralization  is  based  on  the  assumption  that 
the  transmission  of  information  is  costly.  If  this  were  not  so,  there 
would  be  no  reason  not  to  transfer  all  information  on  the  availability 
of  resources  and  the  technology  of  production  to  one  place  and  compute 
at  one  stroke  the  optimum  allocation  of  resources.^/'  On  the  other  hand, 
the  literature  has  tended  to  seek  algorithms  which,  in  some  sense, 
minimize  the  amount  of  information  transferred  but  which  at  the  same 
time  yield  in  the  end  the  fully  optimal  allocation  of  resources.^  In 
short,  there  is  no  true  trade-off  between  information  costs  and  other 
resource  costs.  If  there  were,  one  would  expect  that  an  optimal  allo- 
cation  of  resources,  taking  account  of  information  costs,  would  differ 
from  the  optimal  allocation  in  the  absence  of  information  costs.  The 
standard  tradition  can  be  rationalized  only  by  assuming  that  information 
costs  are  infinitesimal  but  not  zero;  hence,  they  should  be  minimized  to 
the  extent  possible  without  affecting  the  overall  allocation .2/ 

*This  research  was  supported  by  the  Office  of  Naval  Research  Grant 
0NR-N0001U-79-C-0685  at  the  Center  for  Research  on  Organizational 
Efficiency,  Stanford  University. 


The  theory  of  teams  was  introduced  by  Jacob  Marschak  precisely  to 
bring  information  costs  into  the  allocation  process  explicitly;  see  J. 
Marschak  [  1955 1  »  J*  Marschak  and  Radner  [1972]  .  It  does  so  in  a  way 
which  is  polar  to  the  standard  tradition.  It  assumes  a  fixed  amount  of 
communication  in  fixed  channels.  The  "costs”  of  communication  are 
modelled  by  scarcity. 

Team  theory  differs  in  other  ways  from  the  standard  approach.  It 
makes  more  use  of  prior  information  about  the  economy*  In  the  usual 
form  of  the  price-adjustment  or  quant ity-adjustment  iterative  processes 
for  achieving  an  optimal  allocation  of  resources,  the  rule  design  uses 
only  the  broadest  qualitative  information  about  the  economy.  There  are 
no  assumptions  about  the  likely  shapes  of  the  production  functions  or 
the  range  of  possible  levels  of  resource  supplies Jt/  In  team  theory, 
some  or  all  of  the  basic  parameters  of  technology  or  resources  are 
unknown;  otherwise,  there  would  be  no  informational  problem  at  all.  But 
there  is  prior  information  in  the  sense  that  probabilities  are  attached 
to  different  possible  values  of  the  parameters.  The  decisions  made 
under  decentralization  can  then  take  advantage  of  this  knowledge  and 
reduce  the  probability  of  a  bad  decision  as  far  as  possible. 

There  is  still  another  difference  between  the  standard  approach 
and  team  theory,  which  follows  from  the  fact  that  there  are  irreducible 
differences  in  information  among  the  members  of  the  team.  In  the  stan¬ 
dard  approach,  the  decisions  are  ultimately  all  made  by  the  central 
authority,  the  individual  parts  being  only  sources  of  information.  In  a 
sense,  since  relevant  information  is  eventually  equalized,  it  does  not 
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really  matter  vho  makes  the  decisions.  In  team  theory,  as  in  real  life, 
the  allocations  are  ultimately  the  results  of  many  individual  deci¬ 
sions.  The  decentralization  is  real. 

One  implication  among  others  is  that  the  rules  must  be  designed  to 
insure  feasibility  without  a  full  exchange  of  information  on  resource 
supplies . 

The  general  form  of  the  team  problem,  then,  is  this:  The  system 
has  a  number  of  agents,  each  charged  with  making  certain  decisions.  The 
system's  operations  are  governed  by  a  number  of  parameters,  initially 
unknown  to  the  agents.  There  is  a  probability  distribution  over  these 
parameters,  reflecting  prior  knowledge.  Each  agent  is  given  some  infor¬ 
mation  on  some  of  the  parameters.  That  is,  the  probability  distribution 
of  the  parameters  is  conditioned  on  the  agents'  items  of  information. 
Each  agent  then  makes  a  decision  within  its  competence  as  a  function  of 
the  information  available  to  it.  The  team  problem  is  to  choose,  in 
advance,  the  rules  or  decision  functions  for  all  agents.  These  decision 
functions  are  determined  Jointly  to  optimize  the  expected  outcome,  which 
is  a  function  of  the  true  values  of  the  parameters  and  of  the  decisions 
made. 

I  will  not  try  here  to  repeat  in  more  detail  the  general  formula¬ 
tion  of  the  team  problem,  which  can  be  found  in  J.  Marschak  and  Radner 
(1972] .  The  structure  will  be  sufficiently  clear  from  the  specific 
example  to  be  analyzed.  This  example  is  very  simple  but  requires  suffi¬ 
cient  analysis  to  indicate  the  nature  of  the  team  theory  approach  to 
resource  allocation  and  the  problems  that  need  to  be  solved  in  applying 
it. 


-li¬ 
ve  assume  a  known  production  structure,  indeed,  the  simple  one  of 
fixed  coefficients.  What  are  unknown  a  priori  are  the  resource  sup¬ 
plies.  Each  resource  manager  knows  the  supply  of  his  or  her  own 
resource,  and  is  required  to  divide  it  between  the  general  production 
using  all  supplies  and  a  specialized  alternative  use.  The  paper  is 
devoted  primarily  to  establishing  the  decision  miles  for  the  resource 
managers  which  yield  the  optimal  resource  allocation  for  the  given 
information  structure. 

A  modification  of  the  model  would  permit  the  resource  managers  to 
transmit  incomplete  information  about  resource  holdings.  In  that  case, 
the  center,  having  received  the  information,  would  then  issue  decision 
rules  to  the  resource  managers.  The  extension  is  in  fact  very  straight¬ 
forward.  The  calculation  of  the  benefits  from  the  additional  informa¬ 
tion  is  straightforward  in  principle  but  does  not  lend  itself  to  simple 
expression  in  formulas.  These  benefits  should  be  compared  with  the 
costs  of  the  additional  information. 

2.  Formal  Statement  of  the  Model. 

We  are  assumed  to  have  n  resources.  These  resources  can  be  used 
Jointly  to  produce  a  product.  The  production  structure  is  defined  by 
fixed  coefficients,  i.e.,  each  unit  of  output  requires  a  fixed  amount  of 
each  resource.  By  choosing  units  properly,  we  can  assume  that  the  fixed 
coefficients  are  all  1,  i.e.,  that  it  requires  1  unit  of  each 
resource  to  produce  1  unit  of  the  output.  Let, 
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y  =  output, 

=  amount  of  ith  resource  available, 
x^  =  amount  of  i^*1  resource  used  in  production  of  output. 

Then, 

(2.1)  y  =  min  (x^,...,x^)  -  min  x^ 

i 

Clearly,  if  the  only  use  of  the  resources  is  in  the  production  of 
this  product,  there  is  no  interesting  resource  allocation  problem.  The 
entire  amount  are  delivered  to  the  production  unit;  no  resource 

manager  should  be  affected  by  the  fact  that  he  or  she  does  not  know  the 
amounts  of  the  other  resources.  Notice  that  resources  other  than  the 
ones  in  least  supply  are  useless  in  production;  but  if  there  are  no 
alternative  uses,  that  does  not  matter. 

To  make  the  allocation  interesting,  therefore,  assume  that  each 
resource  has  another  use,  with  a  unit  v? iue  which  is  constant  relative 
to  the  Joint  product.  Let, 

w^  =  unit  value  of  resource  in  alternate  use, 
with  Joint  product  as  numeraire  . 

The  aim  of  the  team  is  to  achieve  as  large  a  value  of, 
n 

y  +  l  -  x')  > 

i=l  1  1  1 


as  possible 
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If  the  resource  availabilities  were  known,  this  problem  would 

have  an  easy  solution.  It  must,  of  course,  satisfy  the  feasibility 
constraints , 

(2.2)  0  <  x±  <  all  i  . 

The  problem  would  have  a  totally  trivial  solution  if, 
n 

l  >  1  . 

i=l  " 

for  in  that  case,  it  would  always  be  at  least  as  good  to  engage  only  in 
specialized  use  of  the  resources.  To  see  this,  consider  any  feasible 
allocation.  Now  consider  a  new  one  which  simply  shuts  down  the  joint 
production.  The  value  of  the  specialized  uses  of  the  resources 
increases  by,  V  w  x!,  while  the  amount  of  the  joint  produce  decreases 

«L  X 

by ,  min  x ' .  Hence ,  the  net  gain  is , 
i 

n  n  n 

l  w.x'  -  min  xj  >  V  w.  min  x'  -  min  x!  *  (  f  w.  -  l)  min  xl  >  0 

i=l  i  1  ”  i=l  1  i  1  i  i=l  1  i 

I.e,  there  is  never  any  loss  in  shutting  down  Joint  production.  In  this 
case,  then,  the  decision  rules  for  the  individual  resource  managers  are 
simple:  use  all  the  resources  for  the  specialized  use.  This  rule  can 

be  carried  out  by  the  resource  managers  without  any  knowledge  of  each 
other's  resources  availabililties. 

Hence,  we  assume, 

n 

(2.3)  l  v  <  1  . 

i-1  1 


-7- 


If  there  is  complete  information,  then  clearly  the  optimal  policy 

would  be  to  set,  x'  =  min  x.  for  each  i.  That  is,  we  would  want  to 

J  J 

invest  as  much  as  possible  of  each  resource  in  the  general  production 
process,  subject  to  the  non-wastefulness  condition  that  an  equal  amount 
of  each  resource  be  used.  This  is  because  increasing  the  allocation  of 
each  resource  to  the  general  process  by  1  unit,  if  feasible,  increases 
output  of  the  community  by  1  and  decreases  the  total  value  in  the  spe¬ 
cialized  processes  by  £w^. 

This  allocation,  however,  requires  that  each  resource  manager  know 
the  amounts  of  all  other  resources  (or  at  least  the  minimum  of  all  other 
resource  availabilities).  Let  us  state  explicitly  our  decision  and 
information  assumptions.  Beforehand,  when  the  rules  are  to  be  deter¬ 
mined,  each  of  the  variables  x^  is  assumed  to  be  a  random  variable, 
with  a  distribution  known  to  all.  For  simplicity,  I  willl  assume  that 
these  variables  are  independent  of  each  other.  The  resources  become 
available,  but  for  each  resource  only  the  manager  knows  the  availabil¬ 
ity.  Because  of  the  independence  assumption,  no  manager  has  any  more 
information  about  the  availability  of  other  resources  than  he  or  she  did 
before  the  realization  of  the  random  variables .  The  manager  has  then  to 
determine  the  amount  allocated  to  the  general  production  process,  in 
accordance  with  the  rules  laid  down. 

The  allocation  that  finally  results  depends  on  the  resources 
actually  available;  it  must  in  fact  satisfy  the  resource  feasibility 
conditions  (2.2).  But  it  must  also  satisfy  what  may  be  termed  the 
conditions  of  informational  feasibility;  a  decision  cannot  depend  on 
more  information  than  is  available  to  the  decision-maker. 
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The  last  condition,  stated  formally,  is  that  xj!,  the  amount  of 
resource  i  devoted  to  general  production,  can  depend  only  on  x^,  the 
amount  of  resource  i  available,  for  this  is  all  the  information  avail¬ 
able  to  the  manager  for  resource  i.  The  allocation  x|  cannot  depend 

on  the  availabilities  of  any  resources  x.  for  J  *  i.  The  team 

J 

allocation  problem  then  requires  the  choice  of  function,  x^(x^), 
satisfying  (2.2). 

(2.4)  0  <  x^(x^)  <  x^,  for  all  values  of  x^^  . 

Finally,  what  is  the  maximand  of  the  team  problem?  Since  we 
regard  the  availabilities,  x^,  as  random  variables,  we  must  seek  to 
maximize  the  expected  payoff  for  each  individual  realization, 

n  n 

y  +  I  w.(x.  -  x')  =  min  x!(x. )  +  l  w, [x.  -  x!(x. )]  . 

i=l  1  1  1  i  1  1  i=l  1  1  1  1 

Therefore,  we  can  finally  state  the  team  version  of  the  allocation 
problem  as  follows: 

Choose  n  functions,  x|(x^),  the  i^h  function  depending  on  the 
quant ity  x^  alone,  so  as  to  maximize, 

n 

(2.5)  W  =  Efmin  x'(x. )  +  l  w. (x.  -  x'(x. )]}  , 

i  i=l  1  1  1 

subject  to  the  constraints  (2.4). 

It  may  be  useful  to  remark  that  the  standard  price-adjustment  and 
quantity-adjustment  procedures  amount  to  full  information  revelation  in 
the  present  context.  Since  resource  supplies  are  inelastic,  at  the  very 
first  announcement  of  prices,  the  full  set  of  supplies  is  announced  in 


W 
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the  price-adjustment  procedure.  As  for  the  quantity -adjustment 
procedure,  this  depends  on  the  center's  knowing  how  much  of  each 
resource  is  available  and  hence  presupposes  full  information.  Thus,  in 
the  present  simple  example,  the  standard  methods  of  decentralized 
resource  allocation  require  full  information  by  the  end  of  the  first 
round  of  approximations. 

3.  The  Form  of  the  Optimal  Team ‘Policy. 

Optimizing  on  functions  is  intrinsically  more  difficult  tt 
optimizing  on  variables;  and  the  problem  is  complicated  here  by  the  fa 
that  the  functions  are  defined  over  specified  variables,  differing  fr 
one  to  another.  In  general,  therefore,  team  theory  problems  are  diff. 
cult  to  solve.  But  in  this  case,  the  functions  can  be  shown  to  have 

some  relatively  simple  forms,  and  the  problem  can  be  reduced  to  an 
optimization  over  ordinary  real-valued  variables. 

If  we  have  chosen  our  policy  optimally,  then  it  must  also  be  that 
the  decision  rule  for  any  one  manager  must  be  optimal,  given  the  rules 
for  all  other  managers.  Consider  manager  1.  The  policies,  x!^x^) ,  of 
all  managers  with  i  >  1  are  taken  as  given.  Since  x^  is  independent 
of  x^,  for  i  >  1,  it  is  also  true  that  xj(x^)  is  independent  of 

for  i  >  1.  Hence,  manager  1  may  regard  the  allocations,  xj,  of 
other  managers  as  given  random  variables,  independently  of  the  observed 
variable,  x^. 

Observe  also  that,  to  choose  an  optimal  function  or  rule,  x|(x^) 
is  really  the  same  as  choosing  an  optimal  value  for  x^  for  each  given 
value  of  x^,  since  x1  is  a  known  magnitude  to  manager  1. 
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We  will  rewrite  (2.5)  so  as  to  separate  the  terms  involving  x^ 
from  the  others,  over  which  manager  1  has  no  influence.  We  use  the  fact 
that  the  expectation  of  a  sum  is  the  sum  of  the  expectations.  First, 
write 


(3-l)  W  =  E(min  x'  -  J  w.x!]  +  E[  £  v.x.] 
i  i=l  1  1  i=l  1  1 


The  last  term  is  independent  of  any  manager's  actions;  it  is  Just  a 
constant  which  does  not  affect  the  choice  of  the  optimal  team  decision 
rules.  Therefore,  write  the  first  terms  separately: 


(3.2)  W  =  E[min  x'  -  j  w.x!] 

1  i  1  i=l  1  1 

It  is  W^  that  is  to  be  maximized  by  choice  of  the  decision  functions. 


x^(xi) . 


Now,  to  consider  the  optimization  problem  for  manager  1  alone. 


write. 


x '  *  min  x! 

-1  i>i  1 


Then, 


min  x!  =  min  (x',x*  ) 
i  1  1-1 


Write  (3*2)  as  follows: 


W],  =  Elmin  (x'.x^)]  -  w^  -  E[  \  w^j] 


(3.3) 
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From  the  viewpoint  of  manager  1,  the  values  of  x|  for  i  >  1 
are  given  random  variables.  Hence,  the  third  term  is  a  constant;  in  the 
first  term,  is  taken  as  a  given  random  variable.  Finally,  x-^  is 

known  to  manager  1.  It  does  not  enter  explicitly  in  (3.3)  at  all,  but 
it  does  set  an  upper  bound  to  the  choice  of  x^,  through  the  feasibility 
constraint  (2.2)  for  i  =  1: 


(3.U)  0  <  x^  <  xx  . 


The  manager's  aim  is  to  choose  x^  to  maximize  (3*3)  subject  to 
(3*1*)*  In  technical  terms,  (3*3)  is  a  concave  function  of  the  decision 
variable  x^,  that  is,  the  marginal  contribution  of  x|  decreases.  We 
use  a  well-known  general  result,  recapitulated  in  the  Appendix  to  this 
article:  If  x  is  any  random  variable  and  x*  a  number  to  be  chosen, 
then, 

<3.5)  dEl.mln  U*,x)l  .  ^  (x  „  . 


Apply  this  to  the  differentiation  of  (3*3),  with  x*  replaced  by  x|, 
x  by  x '  : 


fl 

dxi 


=  Prob 


(xll 


>  x')  - 


As  x|  increases,  the  probability  of  exceeding  it  necessarily 
decreases.  As  x^  becomes  very  small,  Prob  (x'^  >  x|)  tends  to  1,  so 
that  dW^/dx^  tends  to  1  -  w^  in  view  of  assumption  (2.3).  this  must 
be  positive.  As  x^  becomes  large,  Prob  (x'^  >  x^)  approaches  0,  so 
that  dW^/dx^  approaches  -  w^  <  0.  Hence,  there  is  an  intermediate 


point,  say  x*,  at  which  the  function  achieves  a  maximum.  It  is 

increasing  up  to  x*  and  decreasing  after  that. 

This  gives  the  unconstrained  maximum  for  W^.  However,  we  have  to 
take  account  of  the  upper  bound  constraint,  (3*4).  Clearly,  if 
x^  >  x*.  Then  the  manager  should  choose  x* ,  the  unconditional  maximum; 
while  if  x^  <  x*,  the  manager  wants  to  choose  x-^,  since  is 

increasing  up  to  that  point.  In  summary,  the  optimal  decision  rule  for 
manager  1  has  the  form, 

x'(x  )  =  min  (x*^)  . 

That  is,  there  is  a  fixed  number  x*  independent  of  x^  which  defines 
the  optimal  rule. 

Since  the  same  argument  holds  for  any  manager,  not  merely  the 
first,  we  have  established  that  the  optimal  decision  rules  have  form, 

(3.6)  x^(xj.)  =  min  (x*^)  . 

These  results  show  that  the  problem  of  choosing  n  functions, 
x'(x  ),  has  been  reduced  to  that  of  choosing  n  numbers,  x*. 

We  will  substitute  (3.6)  into  (3*5)  and  then  optimize  with  respect 
to  the  parameters  x*.  Because  of  (3.6),  the  feasibility  constraints 
(2.4)  are  automatically  satisfied.  To  simplify  the  resulting  expres¬ 
sion,  introduce  the  new  notation, 

x*  =  min  x?  , 

j  * 


x  *  min  x. 


Note  that  x  is  a  random  variable  of  known  distribution,  since  it  is 
the  minimum  of  n  independent  random  variables  whose  distributions  are 
known.  Then,  from  (3.6), 


min  x'(x. )  =  min  min  (x*,x. )  =  min  (min  x*,min  x. )  =  min  (x*,x) 
i  i  i  i  1 

n 

W  =  E(min  (x*,x)  -  J  w  min  (x*,x  )) 

i-1 

Suppose  the  x*'s  are  not  all  equal,  say,  x*  >  x*.  By  definition  of 
x*,  a  small  decrease  in  x*  will  leave  x*  unchanged,  but  it  will 
decrease  min  (x*,x1)  for  at  least  some  values  of  x-^.  The  change 
therefore  increases  the  expression  in  brackets  for  some  values  of  the 
random  variables  and  decreases  it  for  none,  so  that  is  increased. 

Hence  at  the  optimum,  x*  =  x*,  and,  in  general,  x*  =  x*  for  all  i. 

The  optimization  problem  has  now  been  reduced  to  the  choice  of  a 
single  number.  Choose  x*  to  maximize, 

n 

(3*7)  W  (x»)  =  E[min  (x*,x)  -  l  w  min  (x*,x.)) 

i=l 

n 

=  E(min  (x*,x)j  -  ^  w.E[min  (x*,x  )] 

i=l 

Then  the  optimal  decision  rule  for  the  ith  manager  is, 

(3-8)  x^(xt)  =  min  (x»,xi)  . 

k.  The  Determination  of  the  Optimal  Decision  Rule . 

It  remains  then  only  to  determine  x*  so  as  to  maximize  (3»7)« 
The  procedure  is  straightforward;  the  derivative  of  with  respect 


to  x*  is  equated  to  zero.  From  (3«5)»  we  see  how  to  differentiate 
each  of  the  terms  in  (3.7). 


dV  n 

-T- fC  *  Prob  (x  >  x*)  -  y  v.  Prob  (x.  >  x*) 
ax  i=1  1  1 

For  simplicity,  introduce  the  following  notation: 

P(y)  =  Prob  (x  >  y)  , 

P^y)  ■  Prob  (xi  >  y^  . 

These  functions  are  like  the  well-known  cumulative  distribution  func¬ 
tions,  except  that  they  are  cumulated  from  above  rather  than  below. 
Also,  let 


W|(x*) 


dx* 


Then, 


n 

W'(y)  =  P(y)  -  T  w  P  (y)  . 

i=l 

We  now  use  the  assumption  that  the  random  variables  x ^  are  independent 
to  express  P(y)  in  terms  of  the  probabilities  Pi(y).  Indeed,  the 
assertion,  x  >  y,  is  by  definition,  equivalent  to  the  assertion, 
min  x^  >  y,  and  this  in  turn,  by  the  definition  of  a  minimum,  holds  if 
and  only  if  x^  >  y  for  all  i.  But  the  events,  x^  >  y,  for  any  fixed 
y,  are  independent  events,  so  the  probability  of  their  joint  occurrence 
is  the  product  of  the  probabilities  of  their  individual  occurrences.  In 
symbols , 
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P(y)  =  Prob  (x  >  y)  =  Prob  (min  x.  >  y)  =  Prob  (x,  >  y  for  all  i) 

i  1 

n  n 

=  n  Prob  (x.  >  y)  =  n  P. (y)  , 

i=l  i=l 

n 

where  the  symbol,  H  ,  means,  "product  as  i  runs  from  1  to  n." 

i=l 

The  hypothesis  of  independence  is  used  only  in  the  fourth  equality.  We 
have  then, 

(U.l)  W(y)  =  n  P  (y)  -  ”  w  P  (y)  . 

i=l  i=l 

We  know  that  x*  satisfies  the  condition,  W^(x*)  =  0,  but  we  need  to 
show  that  such  a  root  exists  and  in  fact  gives  the  maximum.  In  particu¬ 
lar,  it  should  be  unique,  at  least  in  some  relevant  range;  otherwise  the 
equation  may  have  several  solutions,  only  one  of  which  is  the  true 
optimum. 

Let  x  be  the  smallest  value  of  y  such  that  Pi(y)  =  o  for 

some  i.  It  is  obvious  that  one  would  never  want  to  set  x*  higher 

than  that,  for  the  contribution  of  the  resource  for  which  P^x)  =  0 

will  never  be  more  than  x,  and  any  allocation  of  any  other  resource 

beyond  x  will  be  certainly  wasted.  In  the  Appendix,  the  following  is 
demonstrated:  within  the  range  from  0  to  x ,  there  is  in  ordinary 

circumstances  precisely  one  value  of  x*  for  which, 

W^(x*)  *  0  , 

idle re  W^  is  defined  in  (U.l),  and  this  value  of  x*,  used  in  (3.8), 
defines  the  optimal  team  rules  for  the  managers. 
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5.  Remarks  on  Operating  Characteristics  and  Partial  Information 
Transfer. 

Given  the  optimum  policy,  it  is  always  possible  to  determine  its 
performance  by  substituting  back  into  the  expected  payoff  function,  W, 
as  given  by  (2.5)*  This  can  easily  be  done  numerically  in  any  given 
case,  but  no  simply  interpretable  general  formula  seems  available. 

Within  the  framework  of  the  problem  as  formulated,  there  is  no 
especial  need  to  determine  the  performance  or  operating  characteristics 
of  the  team  decision  rule.  However,  it  would  be  very  useful  if  one  were 
to  contemplate  alternative  team  descriptions,  in  particular  the  intro¬ 
duction  of  partial  or  total  information  transfer.  In  the  latter  case, 
one  could  calculate  the  operating  characteristics  for  complete  informa¬ 
tion,  where  the  optimal  policy  is  to  set, 

x'  =  min  x  , 

J  J 

as  noted  before.  Of  course,  this  rule  will  give  a  higher  expected 
payoff.  However,  there  is  presumably  a  cost  to  the  provision  of 
complete  information,  and  this  should  be  compared  with  the  gain  in 
payoff.  It  is  for  this  reason  that  one  would  want  to  compute  operating 
characteristics . 

We  consider  briefly  the  case  of  partial  information  transfer. 
Suppose  each  resource  manager  provides  the  center  with  a  random  variable 
which  is  a  signal  for  the  availability  of  that  resource.  By  a  signal 
will  be  meant  a  random  variable  whose  distribution  is  conditional  on  the 
actual  value  of  the  resource  but  independent  of  anything  else,  in 
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particular  of  the  availabilities  of  other  resources.  By  Bayes'  Theorem, 
then,  the  center  can  compute  a  conditional  distribution  of  the  resource 
availability,  x^,  for  each  i,  which  differs  in  general  from  the  prior 
distribution.  The  Center  can  compute  x*  as  before  but  using  the 
conditional  distributions,  and  transmit  that  value  as  an  instruction  to 
each  of  the  resource  managers.  The  new  ceiling  on  allocations  to  gen¬ 
eral  production,  x*,  is  now  a  function  of  the  signals  transmitted.  In 
the  extreme  case  where  the  transmitted  signal  is  x^  itself,  we  have 
complete  information,  and  the  value  of  x*  transmitted  will  be  the 
optimal  value  under  complete  information. 

An  important  but  difficult  optimization  problem  is  the  choice  of 
signals,  in  other  words,  of  the  amount  of  information  transfer  to  the 
center.  Clearly  some  signals  are  more  informative  than  others;  that  is, 
they  convey  knowledge  of  the  resource  availabilities  more  or  less  accu¬ 
rately  JL/  Hence,  use  of  a  more  informative  signal  will  yield  a  higher 
expected  payoff.  Presumably ,  however,  more  informative  signals  will 
tend  to  be  more  expensive.  Hence,  there  is  a  need  for  comparing  the 
expected  gains  with  the  expected  additional  costs.  While  the  expected 
gains  can  be  computed  in  any  given  case,  as  already  seen,  little  appar¬ 
ently  can  be  asserted  of  a  general  nature. 


The  proof 8  of  two  assertions  were  left  10  this  Appendix. 

The  first  is  ( 3 «5 )  •  Let  x  be  any  random  variable  and  x*  any 


fixed  number.  We  seek  the  derivative  of  E{min  (x*,x)]  with  respect 
to  x*. 

An  expectation  is  essentially  an  integral.  We  can  differentiate 
it  by  differentiating  under  the  integral  sign.  Now  min  (x*,x)  =  x* 
if  x  >  x*  and  =  x  if  x  <  x*.  Hence 


d  min  (x*,x) 
dx* 


1  if  x  >  x*  , 
0  if  x  <  x*  . 


The  derivative  is  undefined  when  x  =  x*.  However,  if,  as  we  have  been 
implicitly  assuming,  x  is  a  continuous  random  variable,  the  probability 
that  it  takes  on  any  particular  value,  x*,  is  zero,  so  the  lack  of 
definition  of  the  derivative  at  one  point  does  not  affect  the  expecta¬ 
tion. 

We  also  note  that  if  a  function  of  a  random  variable  is  1  on 

some  range  of  x-values  say  the  set  E,  and  0  elsewhere,  then  the 

expectation  of  that  function  is  the  probability  of  E.  Hence, 

2.  lafnjvy)!  „  E  [d  .  Prob  (x  „  > 

as  asserted. 

The  second  assertion  left  for  proof  in  this  Appendix  is  that  the 

function,  W|(y),  as  defined  in  (4.1),  has  precisely  one  zero  as  y 
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varies  from  0  to  x,  where  x  is  the  smallest  value  of  y  for 

which  P^(y)  =  0  for  some  i.  (it  is  not  excluded  that  P^(y)  >  0  for 
all  i  and  all  y,  in  which  case  we  may  take  x  to  be  Infinite.)  0- 

For  y  <  x,  P^(y)  >  0  for  all  i,  and  therefore, 

n 

(A.l)  H  P  (y)  >  0  . 

i=l  V 


Let, 


(A.2) 


V(y) 


W£(y) 


n 

n  P. (y) 


?  Pi(y)  ,  ?  vi 

/,yi- - *  1  *  *  -5 - 

1=1  n  P  (y)  1-1  n  P  (y) 

J*1  J  J*i  J 


From  the  definition  of  Pj^y),  as  the  probability  that  a  random 
variable  is  greater  than  y,  it  must  be  true  that  P^  is  monotone 
decreasing  as  y  increases.  It  is  also  true  that  P^  is  non-negative 
and  in  fact  positive  so  long  as  y  <  x.  Clearly,  then,  the  product  of 
any  number  of  P^s  is  positive  and  monotone  decreasing.  In  particu¬ 
lar,  for  each  i,  the  function, 


n 

n  P,(y)  , 

is  positive  and  monotone  decreasing.  Hence,  V(y)  is  monotone  decreas¬ 
ing.  We  also  observe  that  Pj  (0)  =  1,  from  the  definition  (resource 
availability  is  certainly  positive),  so  that 


»  ' 


V(0)  =  1  -  l  w±  >  0 


J ,  so  that,  from  (A. 2),  V(y)  approaches  negative  infinity.  Therefore, 
V(y)  must  equal  zero  at  least  once  in  the  interval  from  0  to  x. 

Since  V(y)  is  monotone  decreasing,  it  would  ordinarily  be  true 
that  it  has  exactly  one  zero.  Strictly  speaking,  however,  it  is  conceiv¬ 
able  that  a  monotone  decreasing  function  is  zero  on  a  whole  interval. 

Thus  V(y)  is  positive  for  y  small,  0  at  a  point  or  on  an 
interval,  and  then  negative.  By  construction,  W|(y )  has  the  same  sign 
(positive,  zero,  or  negative)  as  V(y).  Hence,  W-^  increases  for 
small  y,  reaches  a  maximum,  which  may  be  at  a  point  or  over  an 
interval,  and  then  decreases. 

Therefore  any  solution  of  the  equations,  V(y)  *  0,  or  W^(y)  =  0, 
may  be  taken  as  the  value  of  the  operating  parameter,  x*,  which  deter¬ 
mines  the  optimal  team  decision  policy. 


For  the  purposes  of  this  paper,  I  assimilate  computation  and  its 
costs  to  information  transfer.  The  economics  of  computation  has 
so  far  not  been  well  integrated  with  that  of  information.  For  an 
early  and  interesting  essay,  see  T.  Marschak  ( 1959)* 

Actually,  there  is  rarely  any  explicit  measurement  of  the  informa¬ 
tion  costs  of  alternative  algorithms  for  achieving  optimal 
resource  allocation.  Most  of  the  work  is  satisfied  with  exhibit¬ 
ing  a  process  which  manifestly  requires  a  good  deal  less  than 
complete  information  transfer.  For  an  explicit  comparison  of 
alternative  algorithms  according  to  the  amount  of  information 
transfer  needed,  see  Oniki  [1974]  • 

What  I  have  called  the  standard  tradition  of  decentralized 
achievement  of  optimal  resource  allocation  goes  back  to  Adam 
Smith's  "invisible  hand,"  and  continues  through  Walras  [1954,  pp. 
84-86 ,  90-91,  105-106,  169-172,  243-254,  and  184-195 1 ,  Pareto 
[1927,  pp.  233-234]  ,  Barone,  pp.  245-290  in  von  Hayek  [  1935 1  ,  J. 
Marschak  (1924],  Lange,  pp.  57-142  in  Lippincott  [1938],  and  the 
more  modern  literature  surveyed  in  Hurwicz,  pp.  3-37  in  Arrow  and 
Hurwicz  (1977] •  Most  of  this  literature  has  emphasized  the  para¬ 
metric  role  of  prices  and  the  successive  adjustment  of  prices  in 
accordance  with  excess  demands  and  supplies.  A  variant  strain  has 
emphasized  quantity  allocations  and  their  revision  so  as  to  bring 
the  shadow  prices  of  each  resource  in  different  uses  into  equal¬ 
ity;  see  especially  Kornai  and  Lipt&k  [1965] ,  Marglin  [1969],  and 
Arrow  [1976]  .  A  very  good  survey  of  the  entire  literature  is 
found  in  Heal  [ 1973 1 • 

It  should  be  added  that  Hurwicz,  pp.  393-412  in  Arrow  and  Hurwicz 
[1977] ,  and  other  papers  has  given  a  more  abstract  version  of  the 
process  of  successive  adjustments  which  includes  the  price-adjust¬ 
ment  and  quantity-adjustment  types  of  algorithms  as  special  cases. 
However,  it  has  in  common  with  the  rest  of  the  literature  that 
information  costs  do  not  appear  explicitly  and  that  the  process 
seeks  a  fully  optimal  allocation  disregarding  such  costs. 

Indirectly,  however,  knowledge  of  the  economy  may  be  used  to 
determine  the  starting  point  of  the  iterative  process.  In  prac¬ 
tice,  of  course,  the  length  of  the  iterative  process  may  be  very 
much  reduced  by  a  suitable  initial  condition. 

The  relation  between  signals,  "more  informative  than,"  is  defin¬ 
able  in  different  ways.  In  general,  it  cannot  be  an  ordering, 
that  is,  given  any  two  signals,  it  is  not  necessarily  true  that 
one  is  more  informative  than  the  other.  The  most  general  useful 
definition  is  that  of  Blackwell  [1953].  With  this  definition,  it 
is  always  true  that  a  more  informative  signal  will  yield  a  higher 
expected  payoff. 
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